categorical attribute meaning in Chinese
类型属性
Examples
- This paper discusses the methods of similarity measurement of most clustering algorithms , and taking the type of attribute as a standard of choosing similarity , it expounds the methods used to measure numerical attribute , categorical attribute and mixed attribute
讨论了在大多数聚类算法中的相似性测量方法,并以属性的类型作为选择相似性的标准,阐述了用于数值属性,符号属性及混合属性相似性测量方法。 - For the sake of further improving performance , this paper made some improvements to sliq . first , we use a new splitting index to evaluate the “ goodness ” of the alternative splits for attributes instead of gini index . secondly , we regard categorical attributes with only two possible values as numeric attributes when evaluate splits
为了进一步提高分类准确率和速度,论文对sliq算法作了一些改进:用新的属性选择度量代替gini索引,用处理连续值属性的方法处理只有两个可能值的分类属性。 - The discussion of main parallel technologies on construction of parallel sliq algorithm is presented in this paper . the computing result of algorithm complexity of sequential and parallel algorithm indicates : when the data set is large enough , as to continuous attributes , the parallel algorithm almost get speedup value equal to the number of processors , while as to categorical attribute the improvement of parallel algorithm is limited
通过对串行和并行算法时间复杂度的计算表明,当数据集充分大时,由于连续属性的排序计算操作分散到各个处理机单元上进行,显著降低了计算时间,从而可以得到近似于处理机个数的加速比,对于离散属性,本并行算法对串行算法的性能提高有限 - Subsequently , clustering analysis in data mining is disserted , involving the methods and characteristics of clustering used in data mining and the methods for evaluating the clustering results , with emphasis on clustering the data with categorical attributes . k - modes clustering algorithm and its variations are introduced with their advantages and disadvantages
在此基础上对数挖掘中的聚类分析作以详细地论述,总结了数挖掘中聚类分析的方法和特点,并对聚类结果的评价方法进行了讨论,重点讨论了分类属性数据聚类,具体研究了k - modes算法及其变形,并指出了它们的优缺点。